Method and system for performing simplified troubleshooting procedures to isolate connectivity problems

ABSTRACT

A method and apparatus for implementing troubleshooting of a network connectivity problem between a client computer coupled to a local switch and an end point on the network utilizes a client-Proxy module instantiated on the local switch. The module automatically runs a series of tests utilizing the IP and MAC addresses of the client computer source addresses and reports the results of the tests.

BACKGROUND OF THE INVENTION

Switches and routers provide a broad set of troubleshooting tools and utilities such as, for example, ping, Layer 3 traceroute, Layer 2 traceroute, etc., that can be combined with the output of various commands to debug network connectivity problems.

However, debugging can become quite challenging for users who are not network specialists. Even for the most basic connectivity problems, it is necessary to go through a step by step process to validate the connectivity checks and isolate the problem.

A typical example of a connectivity problem is depicted in FIG. 1 where a client computer 10, coupled to a port of a Local Switch 12, is unable to connect to an end station host server 14 located on the network 16. Debugging the problem involves running utilities such as ping and traceroute from the client computer.

Ping is a utility to determine whether a specific Internet Protocol (IP) address is accessible. It works by sending a packet to the specified address and waiting for a reply. Ping is used primarily to troubleshoot network connections. Traceroute utilities work by sending packets with low time-to-live (TTL) fields. The TTL value specifies how many hops the packet is allowed before it is returned. When a packet can not reach its destination because the TTL value is too low, the last host returns the packet and identifies itself. By sending a series of packets and incrementing the TTL value with each successive packet, traceroute finds out who all the intermediary hosts are.

These troubleshooting tools and utilities must be initiated at the client's computer because the connectivity problem occurs somewhere along the path taken by packets between the client and end station host server. This requires that the network administrator (the “Admin”) be physically present at the client computer to run the tests or remotely connect with the user to guide her through performing the steps on the client computer.

Thus, either the user is diverted from other tasks in order to assist the Admin or the Admin must move from computer to computer to debug connectivity problems.

The challenges in the field of network administration continue to increase with demands for more and better techniques having greater flexibility and adaptability. Therefore, a need has arisen for a new system and method for debugging connectivity problems between a client computer and an end station host server connected to a network.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view of a system suitable for implementing an embodiment of the invention;

FIG. 2 is a flow chart depicting tests and analysis required to debug a connectivity problem;

FIG. 3 is a flow chart depicting steps performed by an embodiment of the invention; and

FIG. 4 is a block diagram of a network device configured to implement an embodiment of the invention.

DETAILED DESCRIPTION OF THE INVENTION

Reference will now be made in detail to various embodiments of the invention. Examples of these embodiments are illustrated in the accompanying drawings. While the invention will be described in conjunction with these embodiments, it will be understood that it is not intended to limit the invention to any embodiment. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the various embodiments. However, the present invention may be practiced without some or all of these specific details. In other instances, well known process operations have not been described in detail in order not to unnecessarily obscure the present invention.

One embodiment of the invention is a simplified interface that helps in troubleshooting connectivity problems. The interface does not necessarily point to the root cause of the problem, but helps isolate the problem. In computer networks without well-qualified networking administration personnel to support the network, as is common in small medium businesses, this embodiment makes it easier to troubleshoot connectivity problems.

Referring again to FIG. 1, client computer A 10 has problems connecting to an endpoint host server 14 located somewhere in the network 16. The client computer A 10 is directly connected to Ethernet switch 12 marked as “Local Switch” in FIG. 1. In typical networks, the user of client computer A would seek the help of the Admin to help troubleshoot the connectivity problem. The Admin would have to walk user through a series of steps that debug the problem as shown in the flowchart of FIG. 2.

In existing networks the Admin must perform the various tests in the decision blocks of the flowchart from the client computer A. Alternatively, the Admin can guide the user of client computer A through the tests.

FIG. 3 shows a high-level, system block diagram of a Local Switch that may be used to execute software of an embodiment of the invention. The Local Switch includes a memory 120 which can be utilized to store and retrieve software and data for use by the software. Exemplary computer readable storage media include CD-ROM, floppy disk, tape, flash memory, system memory, and hard drive. Additionally, a data signal embodied in a carrier wave may be the computer readable storage medium. The local switch further includes subsystems such as a central processor 122, one or more network ports 124. In FIG. 3 the network ports are shown grouped into Virtual LANs (VLANs) 126. Other switches, routers, or network devices suitable for use with the invention may include additional or fewer subsystems.

FIG. 3 depicts some of the functionality included in the switch. The processor executes a management interface module 128, for example an http interface that permits management of the switch from remote locations connected to the network such as Admin computer 18 coupled to the network at a location remote from Local Switch 12. In the presently described embodiment, the Admin can log on to the switch from any workstation on the network and initiate a proxy_client module 130 that will perform the tests depicted in FIG. 2 as a proxy for the client computer A. Also, in this embodiment the switch provides this set of steps in an integrated manner, so that the Admin can perform the checks easily through the switch management interface. The switch may optionally include a switch_analysis module 132 having functions described below.

FIG. 4 is a flow chart depicting the steps required to initiate the simplified troubleshooting feature of the switch and the events that happen at the switch. The Admin logs into the Local Switch, and selects the option for troubleshooting. The switch is aware of where the clients are connected, their IP addresses and Media Access Control (MAC) addresses. The Admin selects the client having the problem, and selects option for connectivity tests, and enters the server's host name that the client computer cannot reach.

In order to ensure that the results are exactly those that would be encountered by the client PC, the switch must disable the client port, so that traffic from the client does not interfere with the tests. When the administrator requests this troubleshooting functionality, the following things happen on the switch;

-   -   A proxy client module is initialized with an IP address and a         MAC address of the client, and its associated VLAN as well as IP         subnet information. The switch also has knowledge of the DNS         server through its own configuration, while the rest of the         information is gathered from snooping DHCP packets.     -   The proxy_client module interacts with the IP stack through         internal Layer 3 interface, and performs the ping, L3         traceroute, and L2 traceroute as described in the flowchart. The         L2 traceroute is performed with the source MAC address of the         client, and the destination MAC address of the router.     -   Based on the results of the tests, the proxy_client module         interfaces with switch_analysis module to perform the tasks in         block H.

The switch Instantiates the proxy_client module that proxies for the client PC. The proxy_client module has a logical interface on the VLAN on which the client is connected, and assigns the interface the same IP address and MAC address as the client PC. The IP address and MAC addresses are learned through snooping of DHCP or ARP packets involving the client PC. The switch must use the client IP and MAC addresses, and the logical interface in the same VLAN as the client. This will ensure that the packets originating from the switch will traverse the same path as if they had originated from the client, which is necessary to ensure that the test results will point out the problems encountered by the client.

In the flow chart of FIG. 2, the steps in the decision blocks are performed automatically by the proxy_client module when it is instantiated. The steps listed in rectangle B are also performed by the proxy_client module as part of the IP connectivity block. The diagnostic analysis listed in rectangles A and C-G are performed by the Admin or some other diagnostic software not resident in the switch. The analysis of block H can be performed by switch_analysis module as described below.

The results of the tests can be put into can be put into three categories:

-   -   1. the problem is on some other device in the network where the         device is identified by tests like ping/traceroute etc. OR;     -   2. the problem is on the switch OR;     -   3. the problem is on a link on the switch.

Accordingly, the switch can provide additional information for each of the following conditions:

-   -   Block A—the Admin can use the switch to capture the packets         being generated from the Client. This can be used to do further         troubleshooting.     -   Block B—the switch can perform the necessary set of tests to         check the connectivity to the DNS server using the same         algorithms as in the flow chart.     -   Block D, E & F—It is possible for network management application         to use the information reported by the switch and perform         further diagnosis on the exact device in the network where the         problem is occurring.     -   Block H—the switch_analysis module can perform extensive checks         and report the exact problem in most cases. Link level packet         error statistics, and cable diagnostic tests can be used by the         switch to determine if the problem is a cabling error or network         adapter problem. Analysis of the switch hardware state and the         state of different features can also help determine if problem         is due to issues on the switch itself.

In one embodiment, the switch-analysis module performs the following functions:

-   -   Disable the proxy_client, so that the switch can initiate tests         with the client PC to check for problems on the client.     -   The port on which the router is to be reached is checked for         errors. If there are no errors, then the client side port is         checked.     -   The port on which the client is connected is checked for errors.     -   Cable diagnostics tests are run to see if the cable has any         problems.     -   The switch pings the client to see if the IP stack on the client         is responsive or not.     -   If no problems are found, then features on the switch (such as         access control lists) are checked to report all the types of         traffic that the switch would not forward from the client PC.

This capability of the switch_analysis module added to the proxy_client module not only performs tasks that the user would otherwise have been required to do at the client station, it also integrates the results of the tests with knowledge of the network present within the switch, and as seen by the switch, to help get to the root cause of the connectivity problems quicker.

The invention has now been described with reference to the preferred embodiments. Alternatives and substitutions will now be apparent to persons of skill in the art. Accordingly, it is not intended to limit the invention except as provided by the appended claims. 

1. A method for troubleshooting a connectivity problem between a client computer and an end station on a network, with the client computer having a client Internet Protocol (IP) address and a client Media Access Control (MAC) address, and with the client computer directly connected to a port of a local switch, said method, implemented at the local switch, comprising: providing a management interface to a computer coupled to the network; initiating a proxy_client module, in response to commands received on the management interface, that runs a sequence of connectivity tests with the end station utilizing a logical interface with the client IP address and MAC address as the source addresses; and returning the results of the sequence of connectivity tests via the management interface.
 2. The method of claim 1 where the port is part of a VLAN, the method further comprising: creating the logical interface on the VLAN including the port to which the client computer is connected, where the logical interface takes on attributes of the client computer.
 3. The method of claim 2 where the attributes taken on by the logical interface include the client IP and MAC addresses
 4. The method of claim 1 further comprising: determining whether the problem is on the switch, or on a device in the network other that the switch, or on a link on the switch.
 5. The method of claim 4 further comprising: if packets are dropped at the switch, identifying the reason for the packets being dropped; and identifying the feature responsible for dropping the packets, such as failure to respond to ARP queries.
 6. The method of claim 4 further comprising: if packet are dropped or showing errors on a link of the switch, then testing a link where the packets are dropped or showing errors.
 7. A network device comprising: a client port for coupling the network device to a client computer, with the client computer having an Internet Protocol (IP) address and a Media Access Control (MAC) address; a network port for coupling the network device to a network; a memory storing computer program code including a management interface module and a proxy_client module; a processor, coupled to the memory, configured to execute the management interface module to allow a computer on the network to select a connected client computer and an end point host device and to initiate the proxy_client module which runs a series of tests to troubleshoot a connectivity problem between the client computer and end node device and where the proxy client module utilizes the IP and MAC addresses of the client computer as source addresses when running the series of tests.
 8. The network device of claim 7 where the client port is included in a VLAN and where the computer is configured to create a logical interface on the VLAN that takes on attributes of the client computer when running the series of tests.
 9. The network device of claim 7 where the memory stores a switch_analysis module and with the processor configured to identify the reason for the packets being dropped and, when packet are dropped, to identify the feature responsible for dropping the packets, such as failure to respond to ARP queries.
 10. The network device of claim 9 further configured to test a link where the errors are occurring when errors occur.
 11. A system included in a network device for troubleshooting a connectivity problem between a client computer and an end station on a network, with the client computer having a client Internet Protocol (IP) address and a client Media Access Control (MAC) address, and with the network device including a client port adapted to be directly connected to the client computer, said system, implemented at the local switch, comprising: means for providing a management interface to a computer coupled to the network; means for initiating a proxy_client module, in response to commands received on the management interface, that runs a sequence of connectivity tests with the end station utilizing a logical interface with the client IP address and MAC address as the source addresses; and means for returning the results of the sequence of connectivity tests via the management interface.
 12. The system of claim 11 where the client port is part of a VLAN, the system further comprising: means for creating the logical interface on the VLAN including the client port, where the logical interface takes on attributes of the client computer.
 13. The system of claim 12 where the attributes taken on by the logical interface include the client IP and MAC addresses
 14. The system of claim 13 further comprising: means for determining whether the problem is on the network device, or on a device in the network other that the network device, or on a link on the network device
 15. The system of claim 14 further comprising: means for identifying the reason for the packets being dropped if packets are dropped at the switch; and means for identifying the feature responsible for dropping the packets, such as failure to respond to ARP queries.
 16. The system of claim 15 further comprising: means for testing a link where the packets are dropped or showing errors.
 17. A computer program product, executed by a processor on a network device for troubleshooting a connectivity problem between a client computer and an end station on a network, with the client computer having a client Internet Protocol (IP) address and a client Media Access Control (MAC) address, and with the network device including a client port adapted to be directly connected to the client computer, said computer program process comprising: a computer usable medium having computer readable program code physically embodied therein, said computer program product further comprising: computer readable program code executed by the processor for providing a management interface to a computer coupled to the network; computer readable program code executed by the processor for initiating a proxy_client module, in response to commands received on the management interface, that runs a sequence of connectivity tests with the end station utilizing a logical interface with the client IP address and MAC address as the source addresses; and computer readable program code executed by the processor for returning the results of the sequence of connectivity tests via the management interface.
 18. The computer program product of claim 17 where the client port is part of a VLAN, the system further comprising: computer readable program code executed by the processor for creating the logical interface on the VLAN including the client port, where the logical interface takes on attributes of a connected client computer.
 19. The computer program product of claim 18 where the attributes taken on by the logical interface include the client IP and MAC addresses.
 20. The computer program product of claim 17 further comprising: computer readable program code executed by the processor for determining whether the problem is on the network device, or on a device in the network other than the network device, or on a link on the network device.
 21. The computer program product of claim 20 further comprising: computer readable program code executed by the processor for identifying the reason for the packets being dropped if packets are dropped at the network device; and computer readable program code executed by the processor for identifying the feature responsible for dropping the packets, such as failure to respond to ARP queries.
 22. The computer program product of claim 20 further comprising: computer readable program code executed by the processor for testing a link where the packets are dropped or showing errors. 